Characterizing color input devices with well-behaved extrapolation

ABSTRACT

The present invention generates a color characterization model for performing transformation from a device-dependent color space of a color device to a device-independent color space. A first set of color measurement data is accessed corresponding to actual measurements of the color device, wherein the actual measurements define a measurement range in the device-dependent color space, and wherein the measurement data includes data point pairs, each data point pair having corresponding device-dependent values and device-independent values. Next, a second set of data point pairs is generated based on a predesignated set of device-dependent values outside the measurement range, by extrapolating device-independent values from the first set of color measurement data. The color characterization model is then determined based on both the first set of color measurement data and the generated second set of data point pairs. Because the color characterization model is determined based on actual measurements and extrapolated values, the color characterization model is well-behaved and does not exhibit significant overshooting or undershooting beyond the measurement range.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates generally to the field of color characterization of color input devices, and specifically relates to generating a color characterization model for performing transformation from a device-dependent color space to a device-independent color space. The color characterization model is based on measured color data points.

2. Description of the Related Art

Traditional color management systems (CMSs) use color characterizations of color input devices to derive color transformations that transform color coordinates between a device-dependent color space that depends on the device, and a device-independent color space. In particular, color characterization for a color input device such as a camera or scanner can be derived by first capturing a target consisting of predesignated color patches. In a device-dependent color space such as the RGB color space, this results in an RGB bitmap image in which the color of each patch is encoded in an RGB value.

Given a particular target, the color transformation between the device-dependent and device-independent color spaces can be modeled reasonably by polynomials of low degrees. The mathematical method or technique for fitting these polynomials is known as polynomial regression.

A problem arises in the use of polynomial regression based on measured data points. In particular, the use of polynomials tends to overshoot or undershoot beyond the range of the measured data points in a given space. An aspect of this problem is that the device-independent color space typically has constraints associated with it that are imposed by the physical range of the quantities being modeled. In other words, a color in the device-independent color space is typically represented by three coordinates. These three coordinates must satisfy certain constraints and relationships among them to represent a physically possible color. To impose these constraints, domain knowledge rather than the use of purely statistical techniques is required.

If the measured data points from the color input device uniformly fill in the device-dependent color space, the problem of overshooting or undershooting beyond the range of the data points does not arise. However, data points measured from a standard target are rarely distributed uniformly in the device-dependent color space. For instance, measured RGB points typically fill up the inner portions of the RGB cube, or reside in a lower dimensional manifold of the RGB cube. When this happens, the regressed polynomials based on the measured data points are inaccurate in predicting device-independent values beyond the range of the measured data points.

SUMMARY OF THE INVENTION

The present invention addresses the foregoing problems by imposing boundary conditions in the color space of the color device on which polynomial regression is performed. In particular, the invention generates a set of boundary data points that are outside a measurement range of the captured data set, which preferably correspond to axes or corner points of the device-dependent color space of the color input device, and obtains a color characterization based on a best fit to both the actual measurements and the generated data points.

In one aspect, the present invention generates a color characterization model for performing transformation from a device-dependent color space of a color device to a device-independent color space. A first set of color measurement data is accessed corresponding to actual measurements of the color device, wherein the actual measurements define a measurement range in the device-dependent color space, and wherein the measurement data includes data point pairs, each data point pair having corresponding device-dependent values and device-independent values. Next, a second set of data point pairs is generated based on a predesignated set of device-dependent values outside the measurement range, by extrapolating device-independent values from the first set of color measurement data. The color characterization model is then determined based on both the first set of color measurement data and the generated second set of data point pairs.

Preferably, the second set of data point pairs is generated by extrapolating the device-independent values from the first set of color measurement data based on distance and hue angle. Shepard extrapolation may be used. The first set of color measurement data is preferably obtained from a measurement-only color profile corresponding to the color device.

The color characterization model is preferably determined by either multivariate linear regression or nonlinear regression. In addition, the predesignated set of device-independent values preferably corresponds to axes or corner boundaries of the device-dependent color space. The device-dependent color space of the color device is preferably the RGB space. The device-independent color space is preferably any one of the CIELUV space, the CIELAB space or the JAB space. The color characterization model is preferably usable by a color management system.

Because the present invention introduces a set of boundary data points that are outside of the measurement range of the first set of color measurement data, extra penalty is imposed on undesirable polynomials that overshoot or undershoot near device-dependent color space boundaries. In addition, the added data points allow for constraints which are imposed by the physical range of the color space being modeled to be considered.

This brief summary has been provided so that the nature of the invention may be understood quickly. A more complete understanding of the invention can be obtained by reference to the following detailed description of the preferred embodiment thereof in connection with the attached drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view showing the appearance of one embodiment of the invention.

FIG. 2 is a block diagram depicting an example of an internal architecture of the FIG. 1 embodiment.

FIG. 3 is a block diagram depicting a color management module which can carry out a method of generating a color characterization model according to the present invention.

FIG. 4 is a flowchart that illustrates generating a color characterization model for a color input device according to the present invention.

FIG. 5 is a representational diagram depicting the generation of additional data points according to the present invention.

FIG. 6 is a flowchart illustrating the generation of additional data points according to the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Referring to FIG. 1, a view showing the exterior appearance of one embodiment of the invention is shown. Specifically, FIG. 1 depicts computing equipment 100, which includes host processor 103 comprising a personal computer (hereinafter “PC”). Provided with computing equipment 100 are color monitor 101 including display screen 107 for displaying text and images to a user, keyboard 106 for entering text data and user commands into PC 103, and pointing device 111. Pointing device 111 preferably comprises a mouse, for pointing, selecting and manipulating objects displayed on display screen 107.

Computing equipment 100 includes a computer readable memory medium such as floppy disk drive 108, fixed disk 110, and/or CD-ROM drive 109. Such computer readable memory media allow computing equipment 100 to access information such as image data, computer-executable process steps, application programs, and the like, stored on removable and non-removable memory media. In addition, network access 105 allows computing equipment 100 to acquire information, images and application programs from other sources, such as a local area network or the Internet.

Digital scanner 104 and digital camera 102 are both color input devices for which a color characterization model can be generated according to the present invention. Digital color scanner 104 is provided for scanning documents and images and sending the corresponding image data to computing equipment 100. Digital color camera 102 is provided for sending digital image data to computing equipment 100. Of course, computing equipment 100 may acquire digital image data from other color input devices, such as a digital video camera.

FIG. 2 is a block diagram illustrating the internal architecture of the FIG. 1 embodiment. As shown in FIG. 2, PC 103 includes network interface 202 for network access 105, and a central processing unit (“CPU”) 201, that interfaces with computer bus 200. Also interfacing with computer bus 200 are fixed disk 110, random access memory (“RAM”) 207 for use as main memory, read only memory (“ROM”) 208, floppy disk interface 209 to allow PC 103 to interface with floppy disk drive 108, display interface 210 for interfacing with monitor 101, keyboard interface 203 for interfacing with keyboard 106, mouse interface 204 for interfacing with pointing device 111, scanner interface 205 for interfacing with scanner 104, and camera interface 206 for interfacing with digital camera 102.

Main memory 207 interfaces with computer bus 200 so as to provide quick RAM storage to CPU 201 during execution of software programs such as the operating system application programs, and device drivers. More specifically, CPU 201 loads computer-executable process steps from fixed disk 110 or other memory media into a region of main memory 207 in order to execute software programs. Data such as color measurement data can be stored in main memory 207, where the data can be accessed by CPU 201 during execution.

Read only memory 208 stores invariant computer-executable program code, or program or process steps, for basic system functions such as basic input and output (I/O), startup, or reception of keystrokes from keyboard 106.

As also shown in FIG. 2, fixed disk 110 stores computer-executable code for application programs 211 such image processing programs like Adobe® Photo shop™.

Fixed disk 110 also stores color management module (CMM) 218. CMM 218 renders color image data from a device-dependent color space to a device-independent color space, and vice versa. CMM 218 uses measurement data from color measurement profiles to generate the device transforms necessary to transform color image data into the color space of the destination color image data.

Forward model 219 is a data structure by which color behavior of a color device is modeled, and performs the transformation from the device-dependent color space of a color input device to a device-independent color space. Forward model 219 can be embodied into a single device driver such as scanner driver 213 or camera driver 214. The generation of forward model 219, being the color characterization model for performing transformation from a device-dependent color space of a color input device to a device-independent color space, is described in more detail below.

It is also possible to implement CMM 218 according to the invention as a dynamic link library (“DLL”), or as a plug-in to other application programs such as image manipulation programs like the Adobe® Photoshop™ image manipulation program, or as part of scanner driver 213 or camera driver 214.

Fixed disk 110 further stores computer-executable code for monitor driver 212, scanner driver 213, camera driver 214, other device drivers 215, image files 216 and other files 217.

FIGS. 1 and 2 illustrate one example of a computing system that executes program code, or program or process steps, configured to generate a color transform using a data structure by which behavior of a color input device is modeled. Other types of computing systems may also be used.

With reference to FIG. 3, a block diagram depicting CMM 218, which carries out a method of generating a color characterization model according to the present invention, is shown. CMM 218 contains color input characterization module 301 for characterizing color input device 300, as well as color output device profile 305 for defining a color characterization for color output device 306. CMM 218 operates to accept color values in device-dependent coordinates (such as scanner RGB coordinates), apply the color characterization model and output corresponding color values in the device-independent coordinates (such as Luv). CMM 218 also operates to transform the device-independent coordinates into device-dependent coordinates (such as printer CMYK or monitor RGB) based on color output device profile 305. CMM 218 also operates to perform gamut-mapping between the device-dependent and device-independent color spaces.

Color input characterization module 301 contains process steps for an access data module 302 to access sampled color measurement data of color input device 300, including data point pairs of measured device-dependent values and their corresponding device-independent values. Color input characterization module 301 also contains process steps for a generate additional points module 303 for generating additional points outside a range defined by the actual measurements of the color input device. In addition, color input characterization module 301 contains process steps for a determine forward model module 304 for determining a color characterization model (forward model 219) based on the original color measurement data and the additional generated data points.

Color input characterization module 301, access data module 302, generate additional points module 303, determine forward model module 304, and color output device profile 305 can all be embedded within CMM 218, or can be separate application programs accessed by CMM 218, or can be combined into a single application program accessed by CMM 218. In addition, image data can be input from color input device 300 to CMM 218 using forward model 219. In this embodiment, color input device 300 is scanner 104 or digital camera 102, although a different color input device such as a digital video camera can be used.

Color characterization of color input device 300 generally consists of capturing an image of a target consisting of color patches with known color values in a device-independent color space. Popular choices of such a target include the IT8.7 target and the Color Checker. The result of the capture is an device-dependent bitmap image in which the color of each patch is encoded in a device-dependent value such as RGB. Data point pairs having corresponding RGB values and Luv values make up the color measurement data, and define a measurement range. The measurement range typically does not extend to the boundaries of the RGB color space cube.

These color measurement data are specific to color input device 300, and the goal of calorimetric characterization is to establish an empirical relationship between RGB values and color values in a device-independent color space such as CIELUV. More specifically, a mathematical transformation is sought from RGB to Luv that models as accurately as possible the behavior of color input device 300. Such a transformation can be modeled reasonably well by polynomials of low degrees.

In the preferred embodiment, RGB is the device-dependent color space, Luv is the device-independent color space, and a 20-term cubic polynomial is used as the color characterization model for color input device 300. However, other device-dependent and device-independent color spaces can be used, as well as polynomials with a different number of terms. Given the 20-term cubic polynomial, coefficients λ_(i), α_(i), β_(i) are sought such that the following equations satisfy the least squares error condition on the data points.

$\begin{matrix} {{\hat{L}\left( {R,G,B} \right)} = {\lambda_{1} + {\lambda_{2}R} + {\lambda_{3}G} + {\lambda_{4}B} + {\lambda_{5}R^{2}} + {\lambda_{6}{RG}} + {\lambda_{7}{RB}} +}} \\ {{\lambda_{8}G^{2}} + {\lambda_{9}{GB}} + {\lambda_{10}B^{2}} + {\lambda_{11}R^{3}} + {\lambda_{12}R^{2}G} + {\lambda_{13}R^{2}B} +} \\ {{\lambda_{14}{RG}^{2}} + {\lambda_{15}{RGB}} + {\lambda_{16}{RB}^{2}} + {\lambda_{17}G^{3}} + {\lambda_{18}G^{2}B} +} \\ {{\lambda_{19}{GB}^{2}} + {\lambda_{20}B^{3}}} \end{matrix}$ $\begin{matrix} {{\hat{u}\left( {R,G,B} \right)} = {\alpha_{1} + {\alpha_{2}R} + {\alpha_{2}R} + {\alpha_{3}G} + {\alpha_{4}B} + {\alpha_{5}R^{2}} + {\alpha_{6}{RG}} +}} \\ {{\alpha_{7}{RB}} + {\alpha_{8}G^{2}} + {\alpha_{9}{GB}} + {\alpha_{10}B^{2}} + {\alpha_{11}R^{3}} + {\alpha_{12}R^{2}G} +} \\ {{\alpha_{13}R^{2}B} + {\alpha_{14}{RG}^{2}} + {\alpha_{15}{RGB}} + {\alpha_{16}{RB}^{2}} + {\alpha_{17}G^{3}} +} \\ {{\alpha_{18}G^{2}B} + {\alpha_{19}{GB}^{2}} + {\alpha_{20}B^{3}}} \end{matrix}$ $\begin{matrix} {{\hat{v}\left( {R,G,B} \right)} = {\beta_{1} + {\beta_{2}R} + {\beta_{3}G} + {\beta_{4}B} + {\beta_{5}R^{2}} + {\beta_{6}{RG}} + {\beta_{7}{RB}} +}} \\ {{\beta_{8}G^{2}} + {\beta_{9}{GB}} + {\beta_{10}B^{2}} + {\beta_{11}R^{3}} + {\beta_{12}R^{2}G} + {\beta_{13}R^{2}B} +} \\ {{\beta_{14}{RG}^{2}} + {\beta_{15}{RGB}} + {\beta_{16}{RB}^{2}} + {\beta_{17}G^{3}} + {\beta_{18}G^{2}B} +} \\ {{\beta_{19}{GB}^{2}} + {\beta_{20}B^{3}}} \end{matrix}$

These polynomials should minimize the sum of squares of errors: SSE=Σ({circumflex over (L)}(R _(i) , G _(i) , B _(i))−L _(i))²+(û(R _(i) , G _(i) , B _(i))−u _(i))²+({circumflex over (v)}(R _(i) , G _(i) , B _(i))−v _(i))²), where the data point pairs are (R_(i), G_(i), B_(i)) and (L_(i), u_(i), v_(i)). The least squares problem can be stated as solving the following matrix equation, where N is the number of color measurement data points:

${\begin{pmatrix} 1 & R_{1} & G_{1} & B_{1} & \ldots & {G_{1}B_{1}^{2}} & B_{1}^{3} \\ 1 & R_{2} & G_{2} & B_{2} & \ldots & {G_{2}B_{2}^{2}} & B_{2}^{3} \\ \vdots & \vdots & \vdots & \vdots & \ldots & \vdots & \vdots \\ 1 & R_{N} & G_{N} & B_{N} & \ldots & {G_{N}B_{N}^{2}} & B_{N}^{3} \end{pmatrix}\begin{pmatrix} \lambda_{1} \\ \lambda_{1} \\ \vdots \\ \lambda_{20} \end{pmatrix}} = \begin{pmatrix} L_{1} \\ L_{1} \\ \vdots \\ L_{N} \end{pmatrix}$ or, more compactly, RΛ=L.

Typically, N is larger than 20, so the above-equation is over-determined, necessitating a least squares solution. A closed form for the solution for Λ can be given by: Λ=(R ^(T) R)⁻¹(R ^(T) L)

In practice, the closed form solution is not used. Instead, for a more numerically stable algorithm, the least squares solution is obtained by first decomposing the matrix R into a matrix product (for example, using singular value decomposition), followed by back substitution.

As noted above, the use of polynomials tends to overshoot or undershoot beyond the range of the measured data points in a region beyond the range of the measured data points. An aspect of this problem is that the Luv color space typically has constraints associated with it that are imposed by the physical range of the quantities being modeled. In other words, a color in the device-independent color space is typically represented by three coordinates. These three coordinates must satisfy certain constraints and relationships among them to represent a physically possible color. To impose these constraints, domain knowledge rather than the use of purely statistical techniques is required. The present invention addresses the foregoing by introducing boundary points outside of the measurement range of the color measurement data.

Referring now to FIG. 4, a flowchart that illustrates generating a color characterization model for a color input device according to the present invention is shown. Following start bubble S400, color measurement data is accessed in step S401, additional points are generated in step S402 based on the color measurement data, and forward model 219 is determined using both the color measurement data and the generated data points in step S404. The generation of additional points is described in more detail in relation to FIGS. 5 and 6.

With reference to FIG. 5, a representational diagram depicting the generation of additional data points according to the present invention is shown. In the preferred embodiment, eight control points are introduced corresponding to the corners of the RGB color space cube. These points are typically outside of the measurement range of the color measurement data. If the values for color input device 300 are normalized to unity, then the RGB values for these control points are:

-   -   R=0, G=0, B=0     -   R=0, G=0, B=1     -   R=0, G=1, B=0     -   R=0, G=1, B=1     -   R=1, G=0, B=0     -   R=1, G=0, B=1     -   R=1, G=1, B=0     -   R=1, G=1, B=1

Luv values corresponding to each of the listed (R, G, B) control points are then determined. In determining the Luv values, the hue of the (R, G, B) color should be considered.

Generally, for a given (R, G, B) from above, a weight is assigned to each of the (R_(i), G_(i), B_(i)) within a neighborhood of (R, G, B) in the sampled data set. Weight is determined based on two criteria. First, the weight is inversely proportional to the distance between (R, G, B) and (R_(i), G_(i), B_(i)). Second, data points having a significantly different hue than the given (R, G, B) point should be discarded, meaning the weight for those data points should be set to 0. To take the hue into account, only points that lie within a cone whose vertex is at (0, 0, 0) are considered, whose axis coincides with the line joining (0, 0, 0) to (R, G, B), and whose semi-vertical angle θ satisfies cos θ=0.9, if such points exist.

With reference to FIG. 6, a flowchart illustrating the generation of additional data points according to the present invention is shown. Following start bubble S600, the Luv value for white R=G=B=1) is assigned L=100 and u=v=0 in step S601. Then, in step S602, a determination is made as to whether all the Luv values corresponding to the eight corners of the RGB cube have been assigned. If the answer to this inquiry is yes, step S602 is followed by end bubble S613. Otherwise, in step S603, an RGB point without a corresponding Luv value is selected. Cos θ is initialized to 0.9 in step S604, followed by a decision as to whether cos θ is greater than or equal to 0 in step S605. If the answer to this inquiry is no, an error condition occurs in end bubble S614. Otherwise, a maximum radius value MAX_R is initialized to a maximum distance (MAXD) divided by 5 if cos θ exceeds 0, or MAX_R is initialized to MAX_D if cos θ equals 0. MAX_D equals 3 in the case of 1-norm. In step S607, a radius value R is initialized to 0.1. This is followed by an inquiry in step S608 as to whether R is less than or equal to MAX_R. If the answer to this inquiry is no, cos θ is decremented by 0.05 and step S605 is revisited. Otherwise, sample points are collected for a set S in step S609. Then, in step S610, a determination is made as to whether set S is empty. If set S is not empty, an Luv value is calculated based on set S. The calculation of the Luv value is discussed in more detail below. If set S is empty, R is incremented by 0.1 in step S611, and step S608 is revisited. This repeats until set S is not empty.

After yielding a non-empty set S of points (R_(i), G_(i), B_(i)) and corresponding (L_(i), u_(i), v_(i)), then for each such point, a normalized weight w_(i) is assigned as follows:

$h_{i} = {\left( \frac{\max\left( {{{MAX\_ D} - d_{i}},0} \right)}{d_{i}} \right)^{2}\mspace{14mu}{where}}$ d_(i) = R − R_(i) + G − G_(i) + B − B_(i) $w_{i} = {h_{i}/{\sum\limits_{j \in S}h_{j}}}$ where h_(i) is the weight before normalization and satisfies the inverse-distance property, and d_(i) is the distance of (R, G, B) from (R_(i), G_(i), B_(i)).

The extrapolated Luv value for (R, G, B) is then calculated by:

$L = {\sum\limits_{i \in S}{w_{i}L_{i}}}$ $u = {\sum\limits_{i \in S}{w_{i}u_{i}}}$ $v = {\sum\limits_{i \in S}{w_{i}v_{i}}}$

Referring back to FIG. 4, once the additional data points are generated, multivariate linear regression is performed on the N+8 data points (color measurement data points+generated data points). This results in a color characterization model (forward model 219) for input color device 300.

By introducing boundary data points outside of the measurement range of the original color measurement data, extra penalty is imposed on the polynomials that overshoot or undershoot near polynomial boundaries. In addition, the generated data points allow for constraints which are imposed by the physical range of the color space being modeled to be considered.

The invention has been described above with respect to particular illustrative embodiments. It is understood that the invention is not limited to the above-described embodiments and that various changes and modifications may be made by those skilled in the relevant art without departing from the spirit and scope of the invention. 

1. A method for generating a color characterization model for a color input device, the color characterization model for performing transformation from a device-dependent color space of the color input device to a device-independent color space, the method comprising the steps of: accessing a first set of color measurement data corresponding to actual measurements of a color target by the color input device, wherein the color target has known color values in the device-independent color space and the actual measurements are obtained by the color input device in the device-dependent color space, wherein the actual measurements define a measurement range in the device-dependent color space, and wherein the measurement data includes data point pairs, each data point pair having corresponding actual measurements and known color values; generating a second set of data point pairs based on plural device-dependent values outside the measurement range, by extrapolating device-independent values corresponding to the device-dependent values outside the measurement range from the first set of color measurement data; and determining the color characterization model based on both the first set of color measurement data and the generated second set of data point pairs.
 2. The method according to claim 1, wherein said generating step comprises extrapolation of device-independent values from the first set of color measurement data based on distance and hue angle.
 3. The method according to claim 1, wherein the first set of color measurement data is obtained from a measurement-only color profile corresponding to the color input device.
 4. The method according to claim 1, wherein the color characterization model is usable by a color management system.
 5. The method according to claim 1, wherein said determining step comprises multivariate linear regression.
 6. The method according to claim 1, wherein said determining step comprises nonlinear regression.
 7. The method according to claim 1, wherein the plural device-independent values outside the measurement range includes device dependent values at at least one corner point of the device-dependent color space.
 8. The method according to claim 1, wherein the plural device-dependent values outside the measurement range correspond to eight corner points of the device-dependent color space.
 9. The method according to claim 1, wherein the plural device-dependent values outside the measurement range correspond to at least one axis boundary of the device-dependent color space.
 10. The method according to claim 1, wherein said generating step comprises extrapolation using Shepard extrapolation.
 11. The method according to claim 1, wherein the device-dependent color space is RGB.
 12. The method according to claim 1, wherein the device-independent color space is CIELUV.
 13. The method according to claim 1, wherein the device-independent color space is CIELAB.
 14. The method according to claim 1, wherein the device-independent color space is JAB.
 15. The method according to claim 1, wherein the color input device is a color scanner.
 16. The method according to claim 1, wherein the color input device is a digital color camera.
 17. A computer-readable storage medium which stores computer-executable process steps, the computer-executable process steps executable by a computer to generate a color characterization model for a color input device, the color characterization model for performing transformation from a device-dependent color space of the color input device to a device-independent color space, said computer-executable process steps comprising process steps executable to perform a method according to any of claims 1 to
 16. 18. An apparatus for generating a color characterization model for a color input device, the color characterization model for performing transformation from a device-dependent color space of the color input device to a device-independent color space, the apparatus comprising: obtaining unit configured to obtaining a first set of color measurement data corresponding to actual measurements of a color target by the color input device, wherein the color target has known color values in the device-independent color space and the actual measurements are obtained by the color input device in the device-dependent color space, wherein the actual measurements define a measurement range in the device-dependent color space, and wherein the measurement data includes data point pairs, each data point pair having corresponding actual measurements and known color values; generating unit configured to generate a second set of data point pairs based on plural device-dependent values outside the measurement range, by extrapolating device-independent values corresponding to the device-dependent values outside the measurement range from the first set of color measurement data; and determining unit configured to determine the color characterization model based on both the first set of color measurement data and the generated second set of data point pairs. 